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^ ' Abstract 



fS I We argue that certain apparently consistent low-energy effective field theories described by local, Lorentz- 
^ ■ invariant Lagrangians, secretly exhibit macroscopic non-locality and cannot be embedded in any UV 
^ • theory whose 5-matrix satisfies canonical analyticity constraints. The obstruction involves the signs 
• ^ . of a set of leading irrelevant operators, which must be strictly positive to ensure UV analyticity. An 
^ I IR manifestation of this restriction is that the "wrong" signs lead to superluminal fluctuations around 
d I non-trivial backgrounds, making it impossible to define local, causal evolution, and implying a surprising 
IR breakdown of the effective theory. Such effective theories can not arise in quantum field theories or 
weakly coupled string theories, whose 5-matrices satisfy the usual analyticity properties. This conclusion 
applies to the DGP brane-world model modifying gravity in the IR, giving a simple explanation for the 
difficulty of embedding this model into controlled stringy backgrounds, and to models of electroweak 
symmetry breaking that predict negative anomalous quartic couplings for the W and Z. Conversely, any 
experimental support for the DGP model, or measured negative signs for anomalous quartic gauge boson 
couplings at future accelerators, would constitute direct evidence for the existence of superluminality and 
macroscopic non-locality unlike anything previously seen in physics, and almost incidentally falsify both 
local quantum field theory and perturbative string theory. 



^On leave from INFN, Pisa, Italy. 



1 Introduction 



Can every low-energy effective theory be UV completed into a full theory? To a string theorist 
in 1985, the answer to this question would have been a resounding "no." The hope was that the 
consistency conditions on a full theory of quantum gravity would be so strong as to more or less 
uniquely single out the standard model coupled to GR as the unique low-energy effective theory, 
and that the infinite number of other possible effective theories simply couldn't be extended to a 
full theory. In support of this view, the early study of perturbative heterotic strings yielded many 
constraints on the properties of the low-energy theory invisible to the effective field theorist. For 
instance, the rank of the gauge group was restricted to be smaller than 22. 

With the discovery of D-branes and the duality revolution, these constraints appear to have 
evaporated, leaving us with a continuous infinity of consistent supersymmetric theories coupled 
to gravity and very likely a huge discretum of non-supersymmetric vacua P] . If the low-energy 
theory describing our universe is not unique but merely one point in a vast landscape of vacua of 
the underlying theory, then the properties of our vacuum — such as the values of the dimensionless 
couplings of the standard model — are unlikely to be tied to the structure of the fundamental theory 
in any direct way, reducing the detailed study of its particle-physical properties to a problem of 
only parochial interest. This situation is not without its consolations. With a vast landscape of 
vacua, seemingly intractable fine-tuning puzzles such as the cosmological constant problem [2], 
and perhaps even the hierarchy problem can be solved by being demoted from fundamental 
questions to environmental ones, suggesting new models for particle physics pi. 

Given these developments, it is worth asking again: can every effective field theory be UV 
completed? The evidence for an enormous landscape of vacua in string theory certainly encourages 
this point of view — if even the consistency conditions on quantum gravity leave room for huge 
numbers of consistent theories, surely any consistent model can be embedded somewhere in the 
landscape. Much of the activity in model-building in the last five years has implicitly taken 
this point of view, constructing interesting theories purely from the bottom-up with no obvious 
embedding into any microscopic theory. This has been particularly true in the context of attempts 
to modify gravity in the infrared, including most notably the Dvali-Gabadadze-Porrati model jS] 
and more recent ideas on Higgs phases of gravity [El El E] ■ 

In this note, we wish to argue that the pendulum has swung too far in the "anything goes" 
direction. Using simple and familiar arguments, we will show that some apparently perfectly 
sensible low-enegy effective field theories governed by local, Lorentz-invariant Lagrangians, are 
secretly non-local, do not admit any Lorentz-invariant notion of causality, and are incompatible 
with a microscopic S'-matrix satisfying the usual analyticity conditions. The consistency condition 
we identify is that the signs of certain higher- dimensional operators in any non-trivial effective 
theory must all be strictly positive. The inconsistency of theories which violate this positivity 
condition has both UV and IR avatars. 

The IR face of the problem is that, for the "wrong" sign of these operators, small fluctuations 
around translationally invariant backgrounds propagate superluminally, making it impossible to 
define a Lorentz-invariant time-ordering of events. Moreover, in general backgrounds, the equation 
of motion can degenerate on macroscopic scales to a non-local constraint equation whose solutions 
are UV-dominated. Thus, while these theories are local in the sense that the field equations derive 
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from a strictly local Lagrangian, and Lorentz-invariant in the sense that Lorentz transforms of 
solutions to the field equations are again solutions, the macroscopic IR physics of this theory is 
neither Lorentz-invariant nor local. 

The UV face of the problem is also easy to discern: assuming that UV scattering amplitudes 
satisfy the usual analyticity conditions, dispersion relations and unitarity immediately imply a host 
of constraints on low energy amplitudes. One particular such constraint is that that the leading low 
energy forward scattering amplitude must be non-negative, yielding the same positivity condition 
on the higher- derivative interactions as the superluminality constraint. Of course the fact that 
analyticity and unitarity imply positivity constraints is very well known, and the connection of 
analyticity to causality is an ancient one. 

We will focus on models in which the UV cutoff is far beneath the (four-dimensional) Planck 
scale, so gravity in unimportant, though we will also make some comments about gravitational 
theories. Our work thus complements the intrinsically gravitational limitations on effective field 
theories recently discussed in pi ITU]. 

Of course, local quantum field theories have a Lorentz-invariant notion of causality and satisfy 
the usual S'-matrix axioms, so any effective field theory which violates our positivity conditions 
cannot be UV completed into a local QFT. Significantly, since weakly coupled string amplitudes 
satisfy the same analyticity properties as amplitudes in local quantum field theories — indeed, the 
Veneziano amplitude arose from S'-matrix theory — the same argument applies to weakly coupled 
strings. Thus, while string theory is certainly non-local in many crucial ways, the effective field 
theories arising from string theory are in this precise sense just as local as those deriving from 
local quantum field theory, and satisfy the same positivity constraints. 

Positivity thus provides a tool for identifying what physics can and cannot arise in the land- 
scape. Perhaps surprisingly, the tool is a powerful one. For example, it is easy to check that 
the DGP model violates positivity, providing a simple explanation for why this model has so 
far resisted an embedding in controlled weakly coupled string backgrounds. Similarly, certain 
4-derivative terms in the chiral Lagrangian are constrained to be positive, implying for example 
that the electroweak chiral Lagrangian cannot be UV completed unless the anomalous quartic 
gauge boson couplings are positive. 

The fiipside of this argument is that any experimental evidence of a violation of these positivity 
constraints would signal a crisis for the usual rules of macroscopic locality, causality and analyt- 
icity, and, almost incidentally, falsify perturbative string theory. For example, the DGP model 
makes precise predictions for deviations in the moon's orbit that will be checked by laser lunar 
ranging experiments JT] . If these deviations are seen and other pieces of experimental evidence 
supporting the DGP effective theory are gathered, we would also have evidence for parametrically 
fast superluminal signal propagation and macroscopic violation of locality, as well as a non-analytic 
S'-matrix, unlike anything previously seen in physics. The same conclusion holds if future colliders 
indicate evidence for negative anomalous quartic gauge boson couplings. Experimental evidence 
for either of these theories would therefore clearly disprove some of our fundamental assumptions 
about physics. 
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2 Examples 

Let's begin with some examples of the apparently consistent low-energy effective theories we will 
constrain. Of course we should be precise about what we mean by a consistent effective theory — 
loosely it should have stable vacuum, no anomalies and so on, but most precisely, a consistent 
effective field theory is just one that produces an exactly unitary 5*- matrix for particle scattering 
at energies beneath some scale A. 

Consider the theory of a single U{1) gauge field. The leading interactions in this theory are 
irrelevant operators, 

^ = —F.^F'^" + Y^{F,^^n' + Y^iF,.Fn' + ..., (1) 

with A some mass scale and ci,2 dimensionless coefficients. As another example, consider a massless 
scalar field tt with a shift symmetry tt — > tt-I- const. Again the leading interactions are irrelevant, 

C = d^nd^n + ^{d^nd'^nf + ... (2) 

As far as an effective field theorist is concerned, the coefficients Ci_2,3 are completely arbitrary 
numbers. Whatever the q are, they can give the leading amplitudes in an exactly unitary ^'-matrix 
at energies far beneath A. Of course the theories are non-renormalizable so an infinite tower of 

higher operators must be included, nonetheless there is a systematic expansion for the scattering 
amplitudes in powers of (E/A) which is unitary to all orders in this ratio. However, we claim that 
in any UV completion which respects the usual axioms of S'-matrix theory, the q are forced to be 
positive 



Ci>0. (3) 

It is easy to check that indeed these coefficients are positive in all familiar UV completions of 
these models. For instance, the Euler-Heisenberg Lagrangian for QED, arising from integrating 
out electrons at 1-loop, indeed generates Ci 2 > 0. Analogously, we can identify tt as a Goldstone 
boson in a linear sigma model, where n and a Higgs field h are united into a complex scalar field 

^^(v + h)e'''/'' , (4) 
with a potential = ''^(I^P — v'^T- The action for tt, h at tree-level is 

£ = (1 + ^)\d7r)' + {dhf - Mlh'' (5) 

Integrating out h at tree-level yields the quartic term 

Ces = ^,{dnr + ... (6) 

which has the claimed positive sign. 
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Another example involves the fluctuations of a brane in an extra dimension, given by a fleld 
y{x) with the effective lagrangian 



c^-fVT^m'=f[-i + ^^ + ^^ + ...] . (7) 

Again we find the correct sign. Related to this, the Born-Infeld action for a U{1) gauge field 
localized to a D-brane also gives the correct sign for all terms. 

There are also other simple 1-loop checks. For example, imagine coupling fcrmions to $ 
in our UV linear sigma model; for sufficietly large A^, 1-loop effects can dominate over the tree 
terms coming from integrating out the Higgs. For instance, consider integrating out a higgsed 
fermion. Grouping two Weyl fermions with charges ±1 into a Dirac spinor "if, the effective 
Lagrangian is 

"i7^(a^ + i^7')-M^]*. (8) 



At 1-loop, we generate an effective quartic interaction 

resulting again in a positive leading irrelevant operator. 

Note that the positivity constraints we are talking about are not directly related to other 
familiar positivity constraints that follow from vacuum stability. We know for instance that 
kinetic terms are forced to be positive, and that m^0^ and couplings must also be positive. In 
all these cases, the "wrong" signs are associated with a clear instability already visible in the low- 
energy theory. Related to this, the cuclidcan path integrals for such theories are not well-defined, 
having non-positive-definite euclidean actions. 

By contrast, the "wrong" sign for the leading derivative interactions (such as the (dn)^ terms 
above) are not associated with any energetic instabilities in the low-energy vacuum: the correct 
sign of the kinetic terms guarantee that all gradient energies arc positive, with the terms propor- 
tional to the Ci giving only small corrections within the effective theory. Indeed, even if the leading 
irrelevant operators — the only ones to which our constraints apply — have the "wrong" sign, higher 
order terms can ensure the positivity of energy (at least classically), e.g. higher powers of (c^tt)^. 
Related to this, the euclidean path integrals in theories with "wrong" signs do not exhibit any 
obvious pathologies. Of course this non-renormalizable theory must be treated Tising the stan- 
dard ideas of effective field theory, but the healthy euclidean formulation at least pcrturbatively 
guarantees a unitary low-energy ^-matrix when we continue back to Minkowski space. 



3 Signs and Superluminality 

If models with the "wrong" signs have stable, Lorentz-invariant vacuua with perfectly sensible 
and unitary perturbative (S-matrices, why don't they arise as the low-energy limit of any familiar 

UV-complete theories? As we will see, while the trivial vacua of such theories are well-behaved, 
the speed of fiuctuations around non-trivial backgrounds depend critically on these signs, with 
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the "wrong" signs leading to superluminal propagation in generic backgrounds. This in turn leads 
to familiar conflicts with causality and locality which are not present in any microscopically local 
quantum field or perturbative string theory. Exactly how this conflict arises turns out to be an 
illuminating question. 

Let's begin by establishing the connection between positivity-violating irrelevant leading inter- 
actions and superluminality in non-trivial backgrounds. Suppose we expand the effective theory 
around some non-trivial translationally invariant solution of the field equations. As long as the 
background field is sufficiently small, the effective field theory remains valid. Translational invari- 
ance ensures that small fluctuations satisfy a simple dispersion relation, lo^ — v'^{k) with the 
velocity v{k) determined by the higher-dimension operators in the lagrangian. The crucial insight 
is that whether fluctuations travel slower or faster than light depends entirely on the signs of the 
leading irrelevant interactions. 

Let's see how this works in an explicit example. Consider our Goldstone model expanded 
around the solution dfj^iro — C^, where is a constant vector. The linearized equation of motion 
for fluctuations (/? = tt — ttq around this background is 

d^^d.ip = . (10) 

Within the regime of validity of the effective theory, C^C^ <^ A^, all higher dimension interactions 
are negligible - all that matters is the leading interaction, C3. Expanding in plane waves, this reads 

fc% + 4^(C-A;f = 0. (11) 

Since {C -kY > 0, the absence of superluminal excitations requires that the coefficient C3 is positive. 

The case of the electromagnetic field is shghtly more involved — the speed of fluctuations around 
non-trivial backgrounds now depends on both momentum and polarization C/^, and thus on both 
of the leading interactions in the Lagrangian, 

ki^k^ + 32^{F''''k^e,f + ?,2^^{F^-k^e,f = , (12) 

but the conclusion is completely analogous: there exist no superluminal excitations iff the coef- 
ficients ci,2 are both positive. Note that these conclusions hold independently of the particular 
background fleld one turns on. Note too that even when the shift in the speed of propagation 
is very small, v — 1~^<^1, it can easily be measured in the low-cncrgy effective theory by 
allowing signals to propagate over large distances. It is interesting to note that in the case of 
open strings on D-branes, which are governed by a BI Lagrangian of the form (1), the speed of 
propagation in the presence of a background fleldstrength {F + B) 7^ can be computed exactly 
in terms of the so-called "open string metric" and is always slower than the speed of light - which 
is to say, this appearance of the BI Lagrangian in string theory satisfies positivity, with ci,2 > 0. 

At this point all the problems usually associated with superluminality — the ability to send 
signals back in time, closed timelike curves, etc. — rear their heads. On the other hand, such effects 
are appearing within a theory governed by a local Lorentz-invariant lagrangian, a hyperbolic 
equation of motion and a perfectly stable vacuum. It is thus instructive to work through the 
physical consequences of this kind of superluminality and understand exactly when and why these 
theories run into trouble. 



A4 



+ 



5 



3.1 The Trouble with Lorentz Invariance 



That the effective Lagrangian is Lorentz-invariance ensures that Lorentz transforms of solutions 
to the field equations are again solutions to the field equations. It does not, however, ensure 
that all inertial frames are on an even footing. Consider for example the equation of motion for 
fluctuations 9? around translationally-invariant backgrounds of our Goldstone model, 

- v'dl^ = , 

where ~ 1 — is the velocity of propagation. This has oscillatory solutions propagating in 

all directions, e.g. </? = j{x ± vt\ Upon boosting in, say, the x direction, the equation of motion 
becomes 

(1 - v'^') dl^ + 2/3(1 - v") dtd.ip - {v^ - P^) dl^ -v'dl^ = ^. 

whose solutions, e.g. (p = f{x i: j^^^t), arc the Lorentz boost of the original solutions. So far so 
good. However, if > 1, there exists a frame (/3 = 1/f) in which the coefficient of d^Lp vanishes, 
(fi propagates instantaneously and the equation of motion becomes a non-dynamical constraint. In 
this frame it is simply impossible to set up an initial value problem to evolve the field from Cauchy 
slice to Cauchy shce^. When /3 > 1/v, the equation of motion is again perfectly dynamical and 
can certainly be integrated — however, oscillatory solutions to these equations move only in the 
positive X direction, while modes in other directions may be exponentially growing or decaying. 
What's going on? How is it possible that what looks like a stable system in one frame looks 
horribly unstable in another? 

The point is that what look like perfectly natural initial conditions for a superluminal mode in 
one frame look like horribly fine-tuned conditions in another. Indeed, the time-ordering of events 
connected by propagating fiuctuations is not Lorentz-invariant. Observers in relative motion 
will thus disagree rather dramatically about what constitutes a sensible set of initial conditions 
to propagate with their equations of motion — initial conditions that to one observer look like 
turning on a localized source at some unremarkable point in spacetime will appear to the other 
as a bewildering array of fluctuations incident from past inflnity which conspire miraculously to 
annihilate what the original observer wanted to call the localized source. Said differently, the 
retarded Green function in one frame is a mixture of advanced and retarded Green functions in 
another frame. Fixing initial conditions on past infinity thus explicitly breaks Lorentz-invariance. 
In order for the theory to be predictive, we must choose a frame in which to deflne retarded Green 
functions. In sufficiently well-behaved backgrouds, there is a particularly natural choice of frame, 
that in which such conspiracies do not appear. 

Returning to our question of stability vs instability, consider a solution in the highly boosted 
frame in which we turn on a localized source for one of the unstable excitations. A Lorentz boost 
unambiguously maps this to a solution in the stable unboosted frame. The crucial point is that 
the resulting configuration does not look like a small fiuctuation sourccd by a local source — indeed, 
these are explicitly stable according to the equation of motion — but rather involves turning on 

""^Notice that we are dealing with tiny superluminal shifts in the dispersion relation, so wc need huge boost 
velocities to observe these effects, requiring both tt and Vtt to be of order A; however, since the Lorentz invariant 
combination dnird^w remains tiny, the description of the system in terms of the effective theory remains valid for 
all observers, ensuring that these effects obtain well within the domain of validity of effective field theory. 
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initial conditions at a fixed time which vary exponentially in space, along the slice. These do 
not represent instabilities in any usual sense; they simply represent initial conditions which we 
would normally rule out as unphysical. By the same token, a localized fluctuation which remains 
everywhere bounded and oscillatory in the original frame transforms into a miraculous conspiracy 
in the initial conditions that prevents the apparently unstable mode from turning on and growing. 
Crucially, this never happens in theories with null or timelike propagation, in which Lorentz 
transformations carry sensible initial conditions to sensible initial conditions. 

It is enlightening to run through the above logic in translationally non-invariant backgrounds. 
Consider again the Goldstone model with "wrong" sign, C3 < 0, and imagine building, by suitable 
arrangement of sources, a finite-sized bubble of S^vr = condensate localized in space and 
time. Let's begin in the rest frame of the condensate, in which = (C, 0,0,0). Outside the 
bubble, in the trivial tt = vacuum, fluctuations of vr satisfy the massless wave equation and 
propagate along null rays. Inside, however, fluctuations move with velocity f ^ ~ 1 — 
and thus propagate not along the light cone but along a "causal" cone defined by the effective 
metric G^,^ = rj^i, + Aj^d^nduH- When C3 < 0, this cone is broader than the light cone and 
fluctuations propagate ever so slightly superluminally (see fig. [T^). However, since fluctuations 
always propagate forward in time, setting up and solving the Cauchy problem in this background 
is still no problem. 

As above, when C3 < it is possible for the coefficient of the d^cp term in the equation of 
motion of a rapidly moving observer to vanish (see fig. ^p). Inside the bubble the coefficient 
of d^cp in the equation of motion is negative, while outside it is positive — somewhere along the 
boundary of the bubble, then, the coefficient must pass through zero, at which point the equation 
of motion becomes again a constraint. Thus, in any frame in which the causal cone deep inside the 
bubble dips below the horizontal, the bubble has a closed shell on which evolution from timeslice 
to timeslice cannot be prescribed by local hamiltonian flow. This in fact helps explain the peculiar 




Figure 1: Bubbles of non-trivial vacua, tt = C^x^, in our Goldstone model with C3 < 0. (a) In the rest 
frame of the bubble, Cf^ — (C, 0,0,0). The solid lines denote the causal cone inside of which small fluctuations 
are constrained to propagate, (b) The same system in a boosted frame in which the bubble moves with a large 
velocity in the positive x' direction. For sufficiently large boosts, the causal cone dips below horizontal, and small 
fluctuations are only seen to propagate to the left with a different temporal ordering than in the unboosted frame. 
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phenomena seen by this boosted observer. Consider the sequence of events depicted in fig. ^ An 
observer in the rest frame of the bubble sends a superluminal fluctuation from a point, A, deep 
inside the bubble to a point, B, on the boundary at which the wave exits the bubble, proceeding 
at the speed of light to a distant point, C. In a highly boosted frame, the sequence of events will 
have B happening before A or C. How is this possible? The resolution is that the coefficient of 
d^(p vanishes at B, so the evolution of ip aX B can't be predicted from local measurements; instead, 
a constraint requires the spontaneous appearance of two ip excitation just inside and outside the 
bubble, which then continue forwards in time to A and C. 

This is not something with which we are familiar, and makes it seem unlikely any Lorentz 
invariant S-matrix exists within such theories. Indeed, the existence of a prefered class of frames — 
those in which the field equations do not degenerate to constraint equations — suggests that the 
Lorentz invariance of the classical Lagrangian is physically irrelevant, and raises doubts about the 
possibility of embedding such effective theories in UV-complete theories which respect microscopic 
Lorentz invariance and locality. Notice that systems with superluminal propagation are in this 
sense somewhat analogous to Lorentz invariant field theories with ghosts, of which no sense can 
be made unless Lorentz-invariance is explicitly broken. This is because boost-invariance makes 
the rate of decay of the vacuum by ghost emission formally infinite — only if Lorentz-invariance 
is not a symmetry of the theory can the decay rate be made finite. In such systems, however, 
Lorentz-invariance can only arise as an accidental symmetry. 

3.2 Global Problems with Causality 

In the simple system of a single bubble in otherwise empty space, there always exists families of 
inertial frames in which causality is meaningfully defined. In particular, the co-moving rest frame 
of the bubble defines a time slicing in this 'good' class, so we can simply declare that evolution is 
to be prescribed in the rest frame of the bubble and translated into other frames by boosting with 
the spontaneously broken Lorentz generators. Forward evolution in time in highly boosted frames 
may look bizarre to a boosted inertial observer, but it is unambiguous. However, there are always 
backgrounds in which no global rest frame exists — for example, two bubbles of vr condensate fiying 
past each other at high velocity and finite impact parameter, as in fig. |2] — so it is far from obvious 
whether there is any good notion of causal ordering in these theories. 

It is useful to treat this problem with the aid of some formalism. Consider again the wave 
equation for small fiuctuations around a non-trivial background in the Goldstone system, 

C'd^d^ip = , G"" = 7]^"" + ^S^vra'^Tr . (13) 

This equation suggests a natural inverse-metric G^'^ with which to define ip "lightcones" and time- 
evolution ^. The metric G^^p is indeed what determines the light cone structure within the blobs 
in Fig. H Now that we have a metric, we can apply the methodology of General Relativity to 

^Strictly speaking the interpretation of G^u as an effective metric holds only in the geometric optics Umit in 
which the wavelengths are short enough with respect to the distance over which G^^ itself varies. Anyway, if a 
pathology arises already in this limit, and we shall see that it does, we do not need to worry about the case of long 
wavelengths. 
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Figure 2: Two finite bubbles moving with large opposite velocities in the x direction and separated by a finite 
distance in the y direction. The open cones indicate the local causal cones of Tr-fluctuations, and the red line the 
closed trajectory of a series of small fluctuations along these cones. Such closed time-like trajectories make it clear 
that no notion of causality or locality survives in a theory which violates positivity. 



determine whether causahty is meaningfully defined over our spacetime [TB|. A first requirement 
is that the spacetime be time orientable, meaning that there should exist a globally defined and 
non-degenerate timelike vector, t^. To see that this is the case, note that G^i, is 

4c3 

G^u = Vfj-iy - -j^d^T^d^ir + ... (14) 

where the dots stand for terms that can be neglected when d^Trd'^n / A"^ <^ 1 and the effective 
field theory surely makes sense. Then, for C3 < 0, we have Goo > 1 and therefore the vector 
= (1,0,0,0) is globally defined, non-degenrate and time-like. The vector defines at each 
space-time point the direction of time fiow. Future directed timelike curves x^{a) are those 
defined by 

xn'^G^^ > x^'i'^G^^ > . (15) 

The second condition for causality to hold is that there be no closed (future directed) timelike 
curves (CTCs). In the presence of CTCs, the t coordinate is not globally defined — it is multiply 
valued — and time evolution again becomes a constrained, non-local problem, and causality is lost. 

The Goldstone and the Euler-Heisemberg systems are both time orientable, at least for back- 
grounds within the domain of validity of the effective field theory description. Moreover, for 
simple backgrounds like the single bubble of Fig. ^ it is also evident that there are no CTCs, 
so that a sensible, although not Lorentz invariant, notion of causality exists. However, in both 
systems, there exist other backgrounds in which the effective metric G^^, does admit CTCs, and 
time evolution can not be locally defined but must satisfy globally constraints. 
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In our Goldstone system, a simple such offending background is given by two superluminal 
bubbles ffying rapidly past each other, as shown in Fig. ^ Note that a head on collision between 
the two bubbles in the same plane would certainly take us out of the regime of validity of the 
effective theory, with {dir)^ becoming large in the overlap region. But it is easy to check that a 
small separation in a transverse direction — the y direction in the figure — is enough to ensure that 
(Stt)^ can remain parametrically small everywhere in the background, and thus within the effective 
theory. Note that these pathologies only occur in backgrounds where (9^vr)^ passes through zero 
and goes negative — as long as {d^TiY > 0, we can always use vr to define a single-valued time-like 
coordinate. 

Another particularly nice example of such closed timelike trajectories involves the propagation 
of light in a non-trivial background of our "wrong" -signed Euler-Heisenberg system in eq. (Q). 
Consider a homogenous, static electromagnetic field with \E\ = \B\ and E ■ B = 0, such as might 
be found deep inside a cylindrical capacitor coaxial with a current-carrying solenoid, as depicted 
in fig. 121 Photons in this background moving orthogonal to the field, k (x E x B, and polarized 
along E, move with velocity 

1-Cif lip 
V = — ^— 

1 + Cif 

in the direction parallel to the current and f = 1 in the other. If ci < 0, photons in this system 
propagate superluminally. Moreover, as ci|||-Ep —1, the velocity of small fluctuations diverges 
as their kinetic term vanishes: this is the critical value of E for which the light cone of the effective 
metric at each point becomes tangent to the constant time slices of an observer at rest with respect 
to the solenoid. Finally, for ci|||i?p < —1 the forward light cone for the effective metric overlaps 
with the past of the static observer. In particular the cylinder's angular direction is at each point 
within the forward effective light cone, so that a circle between the cylindrical plates at fixed 
Lorentz time represents a CTC for the effective metric! Note that this configuration remains 
entirely within the effective theory, for while E,B A^, all local Lorentz invariants are small — 
indeed they are fine-tuned to vanish. Furthermore, the small fluctuations needed to probe these 
CTCs remain within the effective regime as long as their wavelengths remain large compared to 
A^^ As in the Goldstone example, violations of positivity lead to superluminality and macroscopic 
violations of causality. 

Note that we have been tacitly working with a single positivity-violating field. The situation 
is just as bad, and in some sense rather worse, if we include additional fields. In particular, we 
have relied heavily on the existence, for every configuration within the regime of validity of the 
effective theory, of a locally comoving frame in which the condensate is at rest, i.e. a frame in 
which all superluminal fluctuations propagate strictly forward in the local time-like coordinate. If 
we have two superluminal fields, this is generically impossible. 

Notice that attempting to define a global notion of causality, and a corresponding local Hamil- 
tonian flow, by working in a non-inertial frame — i.e. by working with the metric G in our intrisically 
flat spacetime — runs into problems when the non-inertial metric admits CTCs, since the affine 
parameter cannot be globally defined, so evolution is a globally constrained problem. Now, in 
GR, with asymptotically flat space, CTCs do not arise as long as the energy momentum tensor 
satisfies the null energy condition, i.e. if the matter action satisfies certain restrictions. It is a 
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Figure 3: The field between the plates of a charged capacitor coaxial with a current-carrying solenoid is of the 
form E — and B = Bz. When ci < 0, small fluctuations at fixed r propagate super luminally. For sufficiently 
large fieldstrengths, but still within the regime of validity of the effective field theory, the "causal cone" of small 
fluctuations dips below horizontal, allowing for purely spacelike evolution all the way around the capacitor at fixed 
t, a dramatic violation of locality and causality. 

remarkable fact that if the matter dynamics do not feature either instabihties or superluminal 
modes then the energy momentum tensor satisfies the null energy condition ^2]- Conversely, as 
soon as superluminal modes are allowed, the null energy condition is lost, even in the absence of 
instabilities within the matter dynamics ^21; and CTC's can in principle appear with respect to 
the gravitational metric Qf^i, as well. Therefore, whether gravity is dynamical or not, superluminal 
propagation generally leads to a global breakdown of causality. 

Another well-known energy condition closely related to superluminality is the dominant energy 
condition. It states that T^i,t^ should a be future directed time-like vector for any future directed 
time-like t^, i.e., there should be no energy-momentum flow outside the light-cone for any observer. 
This condition is trivially violated by a negative cosmo logical constant, as well as negative tension 
objects such as orientifold planes in string theory. To make it meaningful one must assume that 
the vacuum contribution is subtracted from T^,^. In this form the dominant energy condition 
follows from the absence of superluminality for a large class of systems. For instance, the sound 
velocity in a fluid is given by dp/ dp, and the dominant energy condition p < p follows from the 
absence of superluminahty dp/ dp < 1. For a single derivatively coupled scalar field the absence 
of superluminality for a general background requires the lagrangian to be a convex function of 
X = (c?^7r)^, C"{X) > 0. This is not the same as the dominant energy condition, which requires 
C'{X) ■ X — C{X) > 0. For small fluctuations around the trivial background with X = 0, 
these conditions agree, but for a general background, the absence of superluminality is a stronger 
condition. Thus the absence of superluminality is a more direct and fundamental requirement 
than the dominant energy condition. 
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3.3 The Fate of Fate 



What have we learned about physics in a Lorentz invariant theory which allows superluminal 
propagation only around non-trivial backgrounds? First, there is no Lorentz-invariant notion of 
causality. Second, for observers in relative motion, disagreements about time ordering can be 
traced to sharp violations of locality; in sufficiently simple backgrounds, both of these complica- 
tions can be avoided by a judicious choice of frame in which evolution is everywhere local and 
causal. Third, in more general backgrounds, attempting to foliate spacetime into (perhaps non- 
inertial) constant-time slices is obstructed by the existence of closed time-like trajectories, so that 
time-evolution can never be locally defined but is always globally constrained. 

Does this mean that effective theories which violate positivity are impossible to realize in 
nature? Not necessarily. Rather, since positivity-violating effective Lagrangians can in principle 
be reconstructed from experiments in completely sensible backgrounds, e.g. by measuring low- 
energy scattering amplitudes in well-behaved backgrounds, these phenomena can be interpreted 
as signaling the breakdown of the effective theory in pathological backgrounds. This is a novel 
constraint on effective field theories, which are normally thought to be self-consistent as long 
as all local Lorentz-invariants remain below a UV cutoff, so that UV-sensitive higher- dimension 
operators in the Lagrangian remain negligible — instead, these effective theories break down in 
the IR when local Lorentz-invariants get sufficiently small. An underlying theory could complete 
the IR physics in two distinct ways. One possibility is that the theory simply does not admit 
backgrounds where local Lorentz invariants can get arbitrarlily small — for instance, if the action 
contains terms with inverse powers of (Stt)^. This means that even the vacuum must spontaneously 
break Lorentz invariance, though local physics need not be violated. Another possibility is that 
the underlying theory is fundamentally non-local and capable of manifesting this non-locality at 
arbitrarily large scales, while remaining Lorentz invariant. In both cases, positivity provides an IR 
obstruction to a purely UV completion of such effective theories. Of course, no known well-defined 
theories, e.g. local quantum field theories or perturbative string theories, realize such macroscopic 
non-locality, so positivity provides an obstruction to embedding these effective field theories into 
quantum field or string theory. Any experimental observation of a violation of positivity would 
thus provide spectacular evidence that one of our most fundamental assumptions about Nature — 
macroscopic locality — is simply wrong. 

4 Analyticity and positivity constraints 

Interestingly, the UV origins of the IR pathologies we have found are visible already at the level of 
2 2 scattering amplitudes: with the wrong signs, these amplitudes fail to satisfy the standard 
analyticity axioms of S-matrix theory. To see why the UV properties of 2 ^ 2 scattering are 
relevant to superluminal propagation, it is illuminating to interpret the propagation of a fiuctuation 
on top of a background as a scattering process. The effect we have described corresponds to the 
re-summation of all tree-level graphs depicted in fig. |3 That the leading vertex is a derivative 
interaction implies a theoretical uncertainty of order 1/A on the position of the interaction, or 
equivalently on the position at which our fiuctuation emerges after having interacted with the 
background. This is because the derivative involves knowing the field at two arbitrarily close 
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Figure 4: Propagation of a small fluctuation around a background represented as a sequence of scattering events. 



points, but the closest we can take two points in the effective theory is a distance of order 1/A — 
the exact position is fixed by the microscopic UV theory. In a typical collision, any advance or 
retardation due to physics on scales smaller than the cutoff is thus unmeasurable in the low- 
energy effective theory. However, during propagation in a translationally-invariant background, 
many scattering events take place, each contributing the same super- or sub- luminal shift. Over 
large distances and after many scatterings, these small shifts add up to give a macroscopic time 
advance or delay that can be measured in the effective theory. This consideration makes it clear 
that the presence/absence of superluminal excitations is a UV question: it depends on the signs 
of non-rcnormalizable operators precisely because these interactions cannot be extrapolated down 
to arbitrarily short scales. 

In a local quantum field theory, the subluminality of the speed of small fluctuations around 
translationally invariant backgrounds follows straightforwardly from the fact that local operators 
commute outside the lightcone. Recall that, in a free field theory, while (vac|T((/)(x)0(y))|vac) 
is the Feynman propagator, (vac|[0(a;), 0(y)]|vac) determines the retarded and advanced Green's 
functions as 

(vac 0(|/)]| vac) = D^etix - y) - D^^{x - y) . (16) 

Therefore, the vanishing of the commutator as an operator statement, 

[0(x),0(y)]=O if {x-yf<Q, (17) 

implies that D^^i{x — y) vanishes outside the lightcone. 

Exactly the same logic holds in the interacting theory. The scalar particles are interpolated by 
some operator 0{x) in the full theory. The Fourier transform of (vac|0(a;)0(i/)|vac) has a delta 
function singularity on the mass shell in momentum space, and the pole structure is such that 
(vac| [0{x),0{ii)] |vac) is interpreted as £>ret(3^ ~ y) ~ Dgj^^^x — y), so that D^-^tix — y) vanishes 
outside the lightcone since the operator [0{x), 0{y)\ does. But exactly the same conclusion follows 
for any translationally invariant background \B) of the theory. Indeed, 

{B \[0{x), 0{y)]\ B) ^ D^x - y) - D^,^{x - y) , (18) 

where D^{x — y) represents the propagator for small fluctuations about the background \B). Thus 
again, D^^^{x — y) vanishes outside the lightcone. 

This argument may appear too quick — after all, our effective field theories with the wrong 
signs for the higher-dimension operators are local quantum field theories — what goes wrong with 
the commutator argument? The problem is precisely in the UV singularities associated with their 
only being effective theories. Due to the derivative interactions, the operator commutators aquire 
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UV singular terms proportional to derivatives of delta functions localized on the light-cone. These 
serve to fuzz-out the light cone on scales comparable to 1/A. Indeed, this is nothing but an 
operator translation of the argument at the end of last section, explaining how superluminality 
can arise as a result of a sequence of collisions with the background field. So it is crucial in the 
above argument that we are dealing with a UV complete theory, with no UV divergent terms 
localized on the lightcone in the commutators. 

The commutator argument is convenient when we have the luxury of an off-shell formulation 
as in local quantum field theories. But what happens if the UV theory is not a local quantum field 
theory, for instance if it is a perturbative string theory? The only observable in string theory is 
the S'-matrix. It is therefore desirable to see whether the positivity constraints we are discussing 
follow more generally from properties of the S'-matrix. 

Indeed, how is causality encoded in the S'-matrix? After all, when we only have access to the 
asymptotic states, it is not completely clear how we would know whether the interactions giving 
rise to scattering are causal or not. This was a vexing question to S- matrix theorists, who wanted 
to build causality directly into the axioms of S-matrix theory. In the end, there was no physically 
transparent way of implementing causality; instead, all the physical consequences of microcausality 
were seen to follow from the assumption that the S'-matrix as a function of kinematic invariants 
is a real boundary value of an analytic function with cuts (and poles associated with exactly 
stable particles) as dictated by unitarity. Of course it is unsurprising that microlocality should 
be encoded in analyticity properties — the textbook explanation for the absence of superluminal 
propagation in mediums like glass relies on the analytic properties of the index of refraction n(u;) 
in the complex frequency plane. 

As we will show momentarily, the positivity constraints on the interactions in the effective 
theories we have been discussing follow directly from the dispersion relation and the assumed 
analyticity properties of the S'-matrix. As such, our conclusions apply equally well to pertur- 
bative string theories, where the S-matrix satisfies all the usual properties — unsurprisingly, as 
the Veneziano amplitude arose in the framework of S'-matrix theory. It is of course elementary 
and long-understood that analyticity and dispersion relations often imply positivity constraints 
(though since such arguments are a little old-fashioned we will review them here in detail) — what 
is not well appreciated is that these positivity conditions can serve as a powerful constraint on 
interesting effective field theories. 

As a warm-up, let us understand why the coefficient of {dirY came out positive in two of 
our explicit examples — integrating out the Higgs at tree-level or fermions at 1-loop. At lowest 
order in the couplings the relevant diagrams are those depicted in fig. El and El Let's consider the 
amplitude for 2^2 scattering, M.{s,t). At leading order and at low-energies, M. is 



where u = —s — t. Of course this amplitude violates unitarity at energies far above A, and the 
theory needs a UV completion. 

Consider first the case where the theory is UV completed into a linear sigma model; the full 
amplitude at tree level is instead 




(19) 




,2 



.2 



2 



(20) 
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Figure 5: Analytic structure of the forward 2^2 scattering amplitude at tree level, in the theory of a Goldstone 
boson UV completed into a linear sigma model with Higgs mass M^. The poles arise from tree- level Higgs exchange 



and of course as s,t ^ oo, J\A{s,t) const. Let's further look at the amphtude in the forward 
direction, as t ^ 0, and define A{s) = A4{s,t 0); note by crossing symmetry A{s) = A{—s). 
The analytic structure of this amplitude in the complex s plane is shown in fig. 6. Now consider 
the contour integral around the contour shown in the figure 



ds A{s) 



(21) 



In the full theory, this amplitude has poles at s = ±M| from the s and u channel Higgs 
exchange. A{s) is bounded by a constant at infinity — more generally, as long as A{s) is bounded 
by |.4.(s)| < |sp at infinity, / = 0. On the other hand, / is equal to the sum of the residues of 
A{s)/s^ at its poles. Since A{s)/s^ = (c3/A'^)s~^ near the origin, there will be a contribution 
from a pole at the origin, as well as from the poles at s = ±M^. Thus, 

resA{s = Mri 

^ A^^ {Mlf ^ ' 

where the factor of 2 accounts for the pole at s = — M| since A{s) is even in s. In the simple 
example at hand the residue of ^ at s = M| is manifestly negative from eq. pO|l . and so C3 must 
be positive. However for the purpose of the future discussion it is useful to trace how positivity 
of C3 arises more generally from unitarity. Indeed, as s — 
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As) lmA{s) = -7:6{s-Ml)Tes[A{s = Ml)]. (23) 

Since by the optical theorem Im^(s) = sa{s) where a{s) is the total cross section for tttt scattering, 
we have 
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Figure 6: Analytic structure of the forward 2^2 scattering amplitude at tree level, with the Goldstone couplings 
arising from integrating out a Fermion at 1-loop. The cuts starting at s = ±4M^ correspond to \1/ pair production. 



which is manifestly positive since the cross section a{s) is positive. 

What about the case with the fermions integrated out at 1-loop? In this case, the analytic 
structure is shown in fig. 7. There is a cut beginning at s = ±4M| corresponding to ^ pair 
production, and extending to ±oo. Now consider again the contour integral / around the curve 
shown in the figure. Again, since A falls off sufficiently rapidly at infinity, / vanishes. As before, 
there is a contribution to / from the pole at the origin, together with 2 xl/(2ni) x the integral of 
the discontinuity across the cut disc[^(s)]/s^. By the optical theorem this is again related to the 
total cross section for tttt scattering, and we are led to the identical expression for cs as above. 

Of course this is not an accident. In fact the difference between the analytic structures of 
these amplitudes is completely an artefact of the lowest-order approximation. Let's consider the 
Higgs theory at 1 loop. The amplitude will now have a cut going all the way to the origin — the 
discontinuity across the cut refiecting the (tree-level) low-energy nn scattering cross section. The 
low energy cross section grows and becomes largest in the neighborhood of the Higgs resonance 
near s = Ml At 1-loop, we see the non-zero Higgs width T. As the physical region for s is reached 
from above (as per the ie presecription) , the resonance is seen since the amplitude takes the usual 
Breit-Wigner form oc (s — + iM/iF/i)"^. There is however no pole on the first or physical 
sheet in the complex s plane — the expected pole at s = — iMfiTh is reached by continuing the 
amplitude under the cut to the second sheet. Of course the presence of the resonance is visible on 
the physical sheet — by a big bump in the discontinuity across the cut in the vicinity of s = M^. 
This analytic structure is exhibited in fig. 8. Of course the analytic structure is the same for the 
full amphtude at all orders, and the ^ theory as well. In fact, this is the usual general structure 
of the forward scattering amplitude — analytic everywhere in the complex plane, except for cuts 
on the real axis (and poles associated with exactly stable particles). Narrow resonances appear as 
poles on the second sheet. 

Note that analyticity fixes C3 to be strictly positive for an interacting theory, rather than 
merely non-negative, as was motivated by the IR arguments of Section 3. Here, and in general, 
the constraints coming from UV analyticity are stronger than those observable in the effective 
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Figure 7: General analytic structure of the forward 2 — > 2 scattering amplitude. Poles associated with narrow 
resonances are reached by going under the cut to the second sheet. 



field theory in the IR. 

It is instructive to exphcity see how perturbative string theory satisfies the usual analyticity 
and positivity requirements. We can also see explicitly that an analogous argument also holds for 
perturbative string amplitudes. Let's consider the amplitude for gauge boson scattering in type 
I string theory in lOD. At lowest order in g^, this only involves open strings, and furthermore if 
we restrict the external gauge bosons to the Cartan subalgebra, the amplitude does not have any 
contributions from massless gauge boson exchnage. The scattering amplitude for gauge bosons 
with external polarizations ei_... ^4 in 10 dimensions has the form ^1] 



Mis,t) = gsK{ei 



T{~s)T{-u) , r(-t)r(-n) ^ n-s)v{-t) 



(25) 



T{l-s-u) T{l-t-u) r(l-s-t)_ 

where we are using a' = 1 units and K is given by 

-ft' = -| (s t ei ■ 64 62 ■ 63 + perm) + | (s ei ■ /C4 63 ■ ^2 62 • 64 + perm) . (26) 

If we take t — > and choose 63^4 = 61^2 in order to look at the forward amplitude relevant for the 
optical theorem, we find 

A^(s,t ^ 0) ^ s tanvrs . (27) 

This function is indeed well-behaved in the complex plane at infinity, and is in fact bounded by 
|A1(s, t — 0)1 < |s| away from the real axis. Thus the same arguments apply, and the coefficient 
of in the forward amplitude is guaranteed to be strictly positive. 

Our arguments are clearly general. Other than standard analyticity properties, all that was 
needed was that the forward amplitude be bounded by |sp at large In fact, under very general 
assumptions, unitarity forces the high-energy amplitude in the forward limit to be bounded by 
the famous Froissart bound |15t ITB] AA{s) < s In^ s, as follows. As s — * cxo with t fixed, the total 
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cross section is dominated by the exchange of soft particles at large impact parameter, so we can 
use the eikonal approximation to get 

M{s,t = -ql) ^-2is J d% e*^^-' (e'''^^''^) - l) . (28) 

Now as long as there is a mass gap, the phase shift should fall off exponentially with impact 
parameter, 5 ~ e~"^^f{s). Locality then suggests /(s) to grow no faster than a power law, 
f{s) ~ with a determined by the spin of the intermediate particles (e.g. for a single spin-J 
particle, a = J — 1). The forward amplitude is thus dominated by events with 6 = s°'e~^'^ of order 
1, i.e. impact parameters beneath b < m^^lns, bounding the amplitude as A4{s) < s s. So 
long as there is a mass gap, which can often be achieved by a mild IR deformation of the theory, 
a violation of the Froissart bound implies a dramatic and abnormal behavior of the theory in the 
UV, with amplitudes that grow faster than any power of s. It thus makes sense to study the low 
energy implications of a normal UV behavior which satisfies the Froissart bound. 

Let us finally give the general complete argument for positivity. For simplicity, we restrict our 
attention to a general scalar field theory with a shift symmetry vr ^ tt + const . The leading form 
of the low-energy effective Lagrangian is of the form 

C = {d^f + ^3 + + . . . (29) 

Note that there is a cubic interaction term — we have not assumed a vr ^ — tt symmetry — which 
might arise in a CP violating theory for which vr is the Goldstone. As we will discuss in the next 
section, the brane-bending mode of the DGP model is described by precisely this cubic interaction. 

The claim is that c must be strictly positive. More precisely, we will find a positivity constraints 
on the forward scattering amplitude A{s) = Ai{s,t 0). The argument is virtually identical 
to the one used in the above examples, with two additional technical subtleties. First, it is well- 
known that the Froissart bound can be violated by the exchange of massless particles, such as 
gauge bosons and gravitons, so we might worry that it will not hold for the scattering of our 
massless tt's, which would allow amplitudes to grow too rapidly at infinity for contours to be 
closed. Secondly, and relatedly, while all the non-analytic behavior of the lowest order amplitudes 
of our examples was associated with UV-completion physics, the exact amplitudes have additional 
cuts in the complex s-plane associated to pair-production of massless particles; in the absence of 
a gap, these cuts extend all the way to s = 0. 

To ensure that cuts from the exchange of massless vr particles do not modify the conclusion of 
positivity, we need to regulate the theory in the IR by giving a small mass m to the vr particles (see 
fig- El)- This also ensures that the Froissart bound is satisfied. The 2^2 scattering amplitude 
A4{s,t,u) is still symmetric in s, t, and u; however, we now have u = 4m^ — (s + 1), so that the 
forward amplitude A{s) = M{s, 0, 4m^ — s) is even around the point s = 2m?, and the s-channel 
and u-channel cuts associated to vr pair production extend on the real axis from Am? to +oo and 
from to — cxD, respectively (thin cuts in the figure). If the trilinear vertex a is non-zero, there is an 
additional contribution to the 2-^2 scattering amplitude coming from single vr exchange, leading 
to additional low-energy poles at the vr mass. However, given the large number of derivatives 
involved in the leading interactions, the residues of these IR poles scale like a positive power of 
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Figure 8: Analytic structure of the forward 2^2 scattering amplitude in the regularized massive theory 



m, and go to zero in the massless limit. Consequently, these poles disappear in the massless 
theory: in particular there is no divergence of the amplitude in the forward (t — > 0) limit. This 
is just a consequence of the fact that, despite the presence of massless particles, the amplitude is 
dominated by short- distance interactions. 

In the forward limit, then, the s-channel and u-channel low-energy poles are located at s = 
and at s = 3m^ (gray poles in the figure), and the cuts starting from to — oo and 4m^ to -|-oo. 

Since we have modified the theory in the deep IR by adding a mass term, we no longer want 
to probe the s — > limit of A{s), instead, we will probe the behavior of A{s) for s near an 
intermediate scale with -C <C A^. We will do this by considering the contour integral 

/ ds A{s) 

Note that is effectively acting as an "RG scale"; since A{s) becomes non-analytic as we 
approach the real axis, this is not a convenient place to probe the amplitude, so instead we will 
not put near the real axis but will instead consider Re(M^) ~ Im(M^). 

Now, / is given by the sum of the residues coming from the pole at s = M^, together with the 
poles near s = m?, 3m^. Since A{s) is bounded by the Froissart bound, once again contribTition 
to the integral from infinity can be neglected. The contribution from the discontinuity across the 
cuts is determined by the total cross section as before. Wc thus have 

Because of the derivative interactions, the second term above is suppressed by powers of /h?. 
Also, since at energies beneath A, a{s) grows at least as fast as s^/A^, for ^ A^ we have that 

[ , sais) f , sa(s) ^ , 

/ ds-. , = 2 / ds — h corrections of order powers of -r^ (32) 

Jcuts (S-M2)3 yeutat.>0 
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Thus we conclude that 



A"{s = M^) 




(33) 
(34) 



= positive up to power suppressed corrections 



So, said precisely, the forward amplitude A{s) away from the real axis, and for <^ s <S A^, is 
an analytic function in the complex plane. Its power expansion around any point Sq in this region 
must begin with a term of the form (s — SqY with a strictly positive coefficient. 

This is all we can say in complete generality. However, in theories where in addition to the 
dimensionful scale A there is a dimensionless weak coupling factor g so that M.{s,t) has an 
expansion in we can say more. Such theories include, for instance, weakly coupled linear sigma 
model completions of non-linear sigma models, where A corresponds to the Higgs mass and g is 
the perturbative quartic coupling in the UV theory, or perturbative string theories, where A is the 
string scale Mg and g is the string coupling gg. For s <^ A^, the tree amplitude in this theory is 
of the form 



Note that low-energy cuts, which are absent at leading order in g, appear at order g'^ precisely as 
needed for 1-loop unitarity. Thus, by considering the contour integral 



and running through the same argument (and now ignoring the contributions from low-energy 
cuts which don't exist at this order in g) we conclude 



Therefore, in a weakly coupled theory, there arc an infinite number of constraints on the effective 
theory: the leading ( in weak coupling g ) amplitude in the forward direction has an expansion as 
a polynomial in with all positive coefficients. For example, the forward scattering amplitude in 
the Goldstone model is 




(35) 



n=l 




(36) 



(37) 




(38) 



while the amplitude for gauge boson scattering in lOD type I string theory is 




(39) 



both of which of course have all positive coefficients. 
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5 The DGP Model 

The DGP model is an extremely interesting brane-world model which modifies gravity at large 
distances. In addition to gravity in a 5D bulk, there is a 4D brane localized at an orbifold fixed 
point with a large Einstein-Hilbert term localized on this boundary, with an action of the form 



S = 2Ml j (fx v^7^(^) + 2Ml j d^xdy TZ^^^ 

J branc J bulk 



(40) 



with M4 ^ M5. The large M| term quasi- localizes a 4D graviton to the brane up to distances 
of order Tc ~ Ml/M^ = 1/m, and at larger distances gravity on the brane reverts to being 5 
dimensional. 

Naively, this model makes sense as an effective field theory up to the lower of the two Planck 
scales M5. However, as in the case of massive gravity fTT], there is in fact a lower scale 

at which a single 4D scalar degree of freedom tt{x) — loosely the "brane-bending" mode — becomes 
strongly coupled jlH]- The classical action for this mode can be isolated by taking a decoupling 
limit as M4, M5 — > 00, keeping A fixed. In this limit both four and five dimensional gravity are 
decoupled and Vc 00 so the physics is purely four-dimensional, leading to the effective action 

C = 3{d7T)' - ^ ^3 . (42) 

The unusual normalization of the kinetic term is for later convenience. Note that the Lagrangian 
is derivatively coupled as expected for a brane-bending mode, and that the vr —>■ —it reflection 
symmetry is broken since the boundary is an orbifold fixed point. All the interesting phenomenol- 
ogy of the DGP model — including the "self-accelerating" solution (which is actually plagued by 
ghosts, as confirmed by a direct 5D calculation in ref. [19]) as well as the modification to the lunar 
orbit — actually follows from this non-linear classical Lagrangian with the scalar coupled to the 
trace of the energy momentum tensor for matter fields as {T^ ^/ M^t: [20] • Indeed, the non-linear 
properties of this theory are what allow it to be experimentally viable, at least classically. 

Now, for realistic parameters, the scale A corresponds to A~^ ~ 10^ km. If, at quantum level, 
all operators of the form 

are generated, then, despite the interesting features of the classical theory, the correct quantum 
theory would lose all predictivity at distances beneath 10'^ km ^H]- It is therefore interesting 
to consider loop corrections in this theory, as was initiated in ^H]; where it was shown that the 
tree-level cubic term is not renormalized. In it was shown that at loop level only operators 
of the form (d'^n)^ are generated, and with additional assumptions about the structure of the UV 
theory, [20] argued that the healthy classical non-linear properties of the theory survive quantum- 
mechanically. 
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Figure 9: Lowest order scattering amplitude in the DGP model. 



These results all follow from the fact that the form of the Lagrangian is preserved by a constant 
shift in the first derivative of tt, 

df^TT d^TT + . (44) 

Naively this suggest that any term in the Lagragian should involve at least two derivatives on 
every vr — however the variation of the cubic term in eq. ()42|) under this transformation is a total 
derivative, and therefore vanishes once integrated. The same holds for the kinetic term, (c^tt)^. 

This symmetry is nothing but 5D Galilean invariance. The position of the brane along the 
fifth dimension yhra.ne{x) is (in some gauge) related to the canonically normalized n{x) by ?/brane = 
^^^^ TT. The model enjoys of course full 5D Lorentz invariance, but in the decoupling limit in 
which vr is the only relevant mode, 

M5, M4 ^ cx) , A = const , (45) 

the brane becomes flatter and flatter, the 'velocity' 9^?/brane goes to zero and a 5D Lorentz trans- 
formation acts on i/brane as a Galilean transformation. This symmetry forces the Lagrangian to 
take the form 

C = 3(97r)2 - l^mr{dnf + 0{d'^{d\)'') , (46) 

that is all further interactions involve at least two derivatives on any tt. ^ 

Indeed, the absence of the (Stt)^^ terms is the only thing making this effective theory special 
in any sense. After all, a generic UV theory yielding a U{1) Goldstone boson vr, which violates CP 
(and hence the vr — vr symmetry), would have the same leading cubic interaction, which is the 
lowest order derivative coupling for a scalar. The only thing that can distinguish the DGP scalar 
Lagrangian from a generic Goldstone theory is the presence of the Galilean symmetry and the 
associated absence of {dnY^ type terms in the Lagrangian. And again, it is the absence of such 
(Stt)^^ terms in the effective action that gives it a chance for non-linear health and experimental 
viability. 

However, precisely this property of the theory makes it impossible to UV complete into a UV 
theory with usual analyticity conditions on the ^-matrix. As we saw in the last section, the 

■^Of course it is possible to make field redefinitions to eliminate the cubic interaction term, but the theory is not 
free, the tree-level 2 — > 2 scattering amplitude is non-zero. The field redefinition n — ip — -^{dcj))^ eliminates the 
DGP term but generates quartic terms of the form -j^ {d(j>)'^ {d^d^4>)'^ , as needed to reproduce the 2 — > 2 amplitude. 
However, the cubic form of the action is most convenient — first, because it makes the Galilean symmetry simply 
manifest, and second, because the coupling to matter is simple: a linear coupling of the form ttT/AI^ to the trace 
of the energy momentum tensor T. 
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coefficient of the (Stt)^ term, which gives rise to an term in the forward amphtude, must be 
strictly positive. Instead, in the DGP model, this operator is forced to vanish by the Gahlean 
symmetry. The amphtude for vr tt scattering has a tree-level exchange contribution from the DGP 
term (see fig. ^ as well as contributions from the higher order term, but they all begin at order 

M{s, t) = '-^^^^ + 0{s\ t\ u') (47) 

In the forward limit t 0, this amplitude vanishes; and in particular the piece proportional to 
vanishes identically, of course there will be some forward amplitude at even higher orders, 
but these will involve even more suppression by powers of A and there will be no piece. We 
conclude that it is impossible to complete an effective theory for a scalar with a shift symmetry of 
the form d^Ti — O^tt + into a UV theory with the usual analyticity properties for the S -matrix. 
Again, this includes any local quantum field theory or perturbative string theory. Conversely, 
any experimental indication for the validity of the DGP model can then be taken as the direct 
observation of something that is not local QFT or string theory. 

Associated with this, it is easy to see that signals about non-trivial vr backgrounds can travel 
superluminally. It is trivial to see that this is possible — the leading interaction term is cubic, and 
therefore around a background, the modification of the speed of propagation for small fiuctuations 
is linear in the background field and can therefore have any sign. And indeed simple physical 
backgrounds allow superluminal propagation, vr is sourced by T, the trace of the stress energy 
tensor. In the presence of a compact spherical object of mass M^,, vr develops a radial background 
7ro(r). The gradient of this solution is [201 



3A 



4r 



(4J 



where Ry = 1/A {M^^/M^y^^ is the so-called Vainshtein radius of the source. In such a Schwarzschild- 
like solution the quadratic action for the fiuctuation ip is 
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(49) 



where {dnipY is the angular part of (Vip)^. The speed c^ad of a fiuctuation moving along the radial 
direction is given by the ratio between the coefficient of {dr^Y and that of 0^ in the equation 
above; on the solution eq. (jlSjl c^^^^ is larger than 1 for any r! 

A plot of c^^^ versus r is given in fig. ^| it starts from 4/3 at r = 0, reaches a maximum of 
3/2 at r ~ Ry and asymptotes to 1 (from above!) for r oo. This is an 0{1) deviation from the 
speed of light in an enormous region of space; for instance for the Sun, Ry is ~ 10^° cm. Clearly 
highly boosted observers can observe parametrically fast propagation, and indeed if they boost 
too much they can observe the peculiar time-reversed sequence of events. It is also easy to find 
spatially homogeneous and isotropic background configurations for which even observers at rest 
can observe parametrically fast signal propagation. 
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Figure 10: The speed of radially moving fluctuations in a Schwarzschild-like solution in DGP. 

Having found superluminal propagation, we run into the same paradoxes as we discussed in 
section 2. For instance two blobs of vr field boosted towards each other in the x direction with a 
small separation in y give rise to the same closed timelike curve problems as in the two boosted 
blob Goldstone examples. However, while there we assumed the presence of suitable sources that 
could give rise to our paradoxical field configuration, here we expect something more. Since the 
simple Schwarzschild-like solution we just described features superluminal propagation, a closed 
timelike curve should appear in the vr field actually sourced by two masses boosted towards each 
other. This is not easy to check: a quick estimate shows that in order to close the closed timelike 
curve the two masses must pass so close to each other that, even if their Vainshtein regions do 
not overlap, the presence of one mass induces sizable non-linearities close to the other, and vice 
versa. In other words, the full solution is not just the linear superposition of two Schwarzschild- 
like solutions — new non-linear anisotropic corrections must be taken into account. It would be 
interesting to further investigate such a configuration and understand whether a closed timelike 
curve really arises. 

It is instructive to contrast this with what happens for a generic Goldstone theory, where 
the leading interaction is still the same cubic term, but we also have the (dn)^ terms. In the 
presence of a generic background field 7ro(x) this interaction gives a contribution to the quadratic 
Lagrangian for the fluctuations which is linear in the background, 

6C = ^{d,d,no - t/^^Dtto) • (50) 

If we turn on a background with constant second derivatives, then the field equation for the 
fluctuation ip is exactly of the form eq. (fTUI) . with C^C^ replaced by d^d^n^. Exactly as in 
the DGP analysis, it appears that superluminal signals are possible since d^dyiiQ has no a priori 
positivity property. However the {dnY term saves the day. We can certainly set up in some region 
a background with constant S^ttq and negligible Sttq, so that the effect of the cubic dominates over 
that of (Svr)^; but this region cannot be larger than L ~ ^JTi/d^TTo, since diTo grows linearly with 
X for constant d'^TCo, and after a while the (Svr)^ term starts dominating the kinetic Lagrangian of 
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the fluctuations. Once this happens, if the coefficient of (Stt)"^ is positive there are no superluminal 
excitations. 

The correction to the propagation speed inside the region where the cubic dominates is 5c ~ 
d^TCo/A^, so the maximum time advance/delay we can measure for a fluctuation traveling all across 
the 'superluminal region' is 

Stra..-- L5cr^ . (51) 

Now, we would normally require ^^ttq ^ A'^ in order for the effective theory to make sense. In 
such a case we immediately get 5tmax ^ l/A, too small a time interval to be measured inside 
the effective theory. However ^20^ argued that in a theory like eq. consistent assumptions 
about the UV physics can be made to extend the regime of validity of the effective theory to much 
larger background fields and to much shorter length scales. In particular, in the presence of a 
strong background field d'^iro ^ the effective cutoff scale is raised from A to A ~ \/Wtiq]~A. 
In this case too the superluminal time advance is unmeasurably small: the size of the region in 
which the effect of the cubic dominates over the quartic is of order of the UV effective cutoff, 
L ~ v^A/9%0 ~ A-i. In both cases the quartic saves the day. Thus, not only does the coefficient 
of the {dnY term have to be positive, it must be set by the same scale as the coefficient of the 
cubic term, a conclusion we could have also reached from the dispersion relation arguments of the 
previous section. 

We have uncovered a subtle inconsistency of the DGP model. As a classical theory, it has 
well-defined, two-derivative, Lorentz invariant equations of motion; this property underlies the 
healthy non-linear behavior of the theory and distinguishes it from more brutal modifications of 
gravity, such as the theory of a massive graviton. However, just as in the simple scalar field 
theory examples studied in the previous sections, which also have Lorentz invariant two-derivative 
equations of motion, the theory suffers from a lack of a Lorentz-invariant notion of causality, which 
is in turn related to a violation of the usual analyticity properties of scattering amplitudes. 

Of course, even in brane models respecting the usual UV locality properties, there are DGP 
terms induced on the brane. What we have shown is that we can not have a decoupling limit 
with M4/M5 — i> 00 holding MI/M4 fixed. This suggests that there is a limit of M4/M5 in any 
sensibly causal theory — it would be interesting to investigate these questions from the geometrical 
perspective of the five dimensional theory in more detail. 

There is also an interesting connection between our constraint on the DGP model and the 
"weak gravity" conjecture of [H]. Both situations involve trying to make some interaction much 
weaker than bulk gravity — in DGP it is the 4D Gravity on the brane, taking M4 ^ M5, while in 
[S], it is the attempt to keep Mpi and the cutoff of the theory fixed, but send l/gl 00. We 
have seen that a simple physical principle — requiring subluminal signal propagation — prohibits 
the DGP limit. Similarly, it appears that other general physical principles — such as the absence 
of global symmetries in quantum gravity — block taking the weak coupling limit. In both cases, 
there are obstacles to making any interaction physically weaker than bulk gravity. 
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6 Positivity in the Chiral Lagrangian 



There are similar positivity conditions in more familiar effective field theories in particle physics. 
Consider for instance the SU{2) chiral Lagrangian, parametrized by the unitary field U = e*'^"""", 

C = f tr(9^f/t9^f/) + L4 [tr(9^f/ta^f/)] ' + L5 [tT{d,U^dM)] ' + ■ ■ • (52) 

There is a solution of the equations of motion with vr pointing in a specifc isospin direction which 
we can take to be a^, of the form 

TT^ix) = C^X^ (53) 

We can look at the small fluctuations of both vr'^ as well as tt^ around this background. It is then 
easy to check that in order for both vr'^ and vr^ to propagate subluminally we must have 

U,^ > (54) 

In our previous Abelian examples, the 4-derivative terms were the leading irrelevant interactions 
in the theory, and so did not have any logarithmic scale dependence. On the other hand, L^^^ 
are logarithmically scale dependent; so the positivity constraint is then actually a constraint 
on the running couplings at energies parametrically smaller than A ~ 47r/. Indeed, we can 
imagine turning on a background where d^ir^ is approximately constant over a length scale R 
much larger than the cutoff scale; in order to avoid superluminality we should demand that that 
running L4 5 evaluated near this scale are positive. Of course, the log running of L4 5 induced 
off the lowest-order 2-derivative term pushes ^4^5 positive, and so in a theory without a weak- 
coupling expansion, ^4^5 at low-energies are dominated by the log running contribution and there 
is no interesting constraint on the UV physics. However in theories with a weak coupling g, the 
matching contribution to L4 5 at the scale A will dominate over the log-running contribution down 
to energies of order Ae~^^^ , and we can independently identify the matching contribution to L4 5 
from the high-energy physics from the low-energy running contribution, and hence the positivity 
bound is a non-trivial constraint. 

Naturally, the existence of these sorts of positivity constraints following from dispersion theory 
are very well known, though not said very explicitly; our present example was discussed (though 
perhaps not widely recognized) in the literature long ago [24]. 

Of course the pion chiral Lagrangian follows from QCD which is a local quantum field theory, 
so these conditions must neccessarily be satisfied. The situation is perhaps more interesting 
for the electroweak chiral Lagrangian governing the dynamics of the longitudinal components of 
the W/Z bosons. While it is most likely, given precision electroweak constraints, that the UV 
completion involves Higgses and a linear sigma model, there may also be more exotic possibilities, 
including in the extreme case a low fundamental scale close to the electroweak scale. This physics 
should manifest itself through the higher-dimension operators in the effective Lagrangian, and 
assuming custodial SU{2) is a good approximate symmetry, the constraint on the electroweak 
chiral Lagrangian is the same L^^^ > (with the derivatives covariantized for the SU{2) x U{1) 
gauge symmetry 9^ D^). These operators are not associated with the well-known constraints 
of precision electroweak physics — instead, in unitary gauge [/ = 1, they represent anomalous 
quartic couplings for the W/Z, which must be positive. 
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7 Examples from String Theory 



7.1 Little String Theory 

As we have seen, UV theories which are local or, what is the same, satisfy the usual analyticity 
properties of S'-matrix theory, give rise to effective theories with positivity constraints on certain 
leading irrelevant interactions that forbid superluminality and macroscopic non-locality. If we 
experimentally measure such interactions and find that they are zero or negative, then we have 
direct evidence for a fundamentally non-local theory. But if we also happen to know some of 
these operators theoretically, on other grounds, we can use them as a locality test for the UV 
completion. 

The prime candidate for such a test is of course M-theory, which does not have a weakly 
coupled description, and is thought by many to be fundamentally non-local. However, as we saw 
in the last section, in gravitational theories, there is no well-defined way to extract information 
about higher-dimension operators from superluminality constraints, since the notion of the correct 
metric to use for the GR lightcone can be modified by higher-dimension operators, while just 
gravity already bends all signals inside the underlying Minkowski lightcone. Associated with 
this, gravitational amplitudes are dominated by long-distance graviton exchange in the forward 
direction, with t-channel poles, and the dispersion relation arguments can't be used. 

However, we can certainly study non-gravitational UV completions of higher-dimensional gauge 
theories, especially supersymmetric ones. In five dimensions, maximally supersymmetric Yang- 
Mills theories are UV completed into the six dimensional (2, 0) superconformal theory, which 
although mysterious is still a local CFT. On the other hand, 6D super- Yang Mills is UV completed 
into the 6D little string theory, which is a non-gravitational string theory with string tension set 
by the 6D Yang-Mills coupling but no small dimensionless coupling. This is another candidate 
for a "non-local" theory. This issue can be probed if we can determine the coefficient of the 
operators in the low-energy SYM theory — if any of them have the "wrong" sign, this would prove 
that the LST is dramatically non-local. 

Some of these terms have in fact been determined by a variety of methods. For instance, 
the U{N) theory has a Coulomb branch where it is Higgsed to U{1)^ , and far out along the 
Coulomb branch, where the W^s are much heavier than the little string scale, the coupling 
between the [/(l)'s have been computed |2lj. They are all positive. This is perhaps not surprising, 
since they are related by dualities to coefficients in weakly coupled theories where the signs 
are determined. Thus, if LST is really non-local, the non-locality did not put in an appearance in 
the F^ terms. 

7.2 Non-commutative theories 

Probably the best studied example of the Lorentz-violating system in string theory is a stack of 
parallel D-branes in flat space-time with constant non-zero value of antisymmetric tensor field 
Bfj^u along the brane world-volume. It is instructive to see how superluminality constraints work 
in this case. In the presence of non-zero 5^,^ open string modes localized on D-branes propagate 
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in the effective metric G^u related to the closed string metric 77^1, in the following way [22]) 

Gf^u = - {27ra'YB^xB^ . (55) 

The metric G^j. is a direct analogue of the effective metric discussed in sect. 3.2, so we expect its 
causal cone to be contained in the light cone of the Minkowski metric. To check this let us pick up 
an arbitrary light-like vector n'^ = (l,n*) and calculate its norm with respect to the open string 
metric G^^. One finds 

n''G,,,n'' = -{E, - mE^n^ - Bi^n^f 

where Ei = Bqi. Consequently, n^^ is either space-like or null with respect to the open string 
metric, implying that its causal cone is indeed contained in the usual light cone. Note that this 
conclusion holds not only for small values of B^^^, but in the presence of strong field as well. For 
the background of the electric type one should of course require that the electric field is smaller 
than the critical value when metric G^i, changes its singature. Physically at this point vacuum 
becomes unstable towards pair production of massive charged string modes. 

In the limit of large magnetic field the propagation velocity of open modes vanishes as compared 
to the speed of closed modes. In this limit closed strings decouple from open modes and dynamics 
of the open sector is described by the non-commutative field theory. At the classical level this 
theory exhibits approximate Lorentz-invariance (with open string metric playing the role of the 
Lorentz metric) which is broken by the higher-dimensional operators proportional to the non- 
commutativity parameter. Interestingly, these theories allow soliton solutions which can propagate 
faster than the speed of "light" as defined by the open string metric [2H1- This fact however, 
neither contradicts to our superluminality constraints nor leads to any problems with causality 
because these solitons still propagate inside light cone of the closed string metric which is the true 
Lorentz metric of the underlying theory. 

8 Gravity 

So far, we have been discussing non-gravitational theories. In a theory with gravity, there is a 
natural UV cutoff scale of order Mpi, and it is natural to ask whether there are constraints on 
1/Mpi suppressed operators from our considerations. 

It is easy to see that there can't be any straightforward analog of our superluminality con- 
straints in GR. The reason is that in a gravitational theory, it is natural to define the light-cone 
by Qfj^^. The effect of any higher dimension operator on the GR lightcone can then completely be 
absorbed into a redefinition of the metric by 1/Mpi suppressed operators. There is also no analog 
of our arguments using the vanishing of commutators of local operators outside the light-cone. 
There are no local gauge invariant operators in gravity — one of the reasons the only observable in 
a quantum theory of gravity in fiat space is the S'-matrix. 

However, we might take another track. For weak enough gravitational fields we can certainly 
think of General Relativity as a Lorentz invariant theory of a spin-2 field interacting with matter 
fields in Minkowski space. The underlying Minkowski space has a metric rjf^iy and a well defined 
light-cone, which we shall refer to as the "Minkowski light-cone". On the other hand a classical 
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gravitational field defines a new light-cone, the cone of null geodesies of the full metric f?^,^ irra- 
diating from a point; we call this the "gravity light-cone". In the weak field approximation we 
are considering, the fact that massless particles propagate along the gravity light-cone takes into 
account the interaction of these particles with the background gravitational field. This is very 
similar to what we have been discussing so far, the propagation of signals in a Lorentz-violating 
background field. Indeed we can ask whether it is possible to turn on a gravitational field such 
that the Minkowski light-cone lies inside the gravity one, so that massless excitations, interact- 
ing with the background gravitational field, can actually travel outside the underlying Minkowski 
light-cone. 

At first this seems to be trivially possible, since the effect the gravitational field has on the 
dispersion relation of a massless particle is linear in the background field itself, and as such it has 
no a priori definite sign. In fact in the geometric optics limit the dispersion relation is simply the 
statement that the particle wave-vector /c^ is null with respect to the full metric gf^i, = 77^,^ + h^y, 

iv^" - h>^n k^K = . (56) 

If hfj_i, is not a negative-definite matrix, then there exists a fc^ obeying the above equation that 
is time-like with respect to the underlying Minkowski metric, which means that the particle 
can travel outside the Minkowski light-cone. For instance a plane gravitational wave has a non 
negative-definite h^^. 

However we must wonder how we actually set up the background gravitational field. For solving 
Einstein's equations we need to fix the gauge. Since we want to preserve Lorentz invariance, we 
choose De Bonder gauge, d^{h^y — ^ hrj^^) = 0. Einstein's equations then read 

0{h^y - \ hrif,y) = -WnGT^y , (57) 

where T^,^ is the sources' stress-energy tensor. Outside the sources we can further constrain the 
gauge by setting h = 0. The retarded gravitational field ouside the sources is therefore 

V(i, x) = -AG j dh i T^,{t - r, X + r) . (58) 

Now it is clear that h^^ k^k'^ can be made positive only if T^^, k^k^ is negative somewhere, but this 
is in contradiction with the Null Energy Condition. Notice that a violation of the Null Energy 
Condition under very broad assumptions leads either to instabilities at arbitrarily short time-scales 
or to superluminal propagation in the matter sector ^2]- The negativity of h^^ physically means 
that even if gravitational waves are present, the contribution of the static Newtonian potential to 
h^jj due to the very same sources that emitted the gravitational waves is always larger than the 
oscillatory one, and negative definite.^ 

We are therefore led to the conclusion that, in the weak field approximation we are in, it is 
impossible to set up a gravitational field such that null geodesies move outside the underlying 
Minkowski light-cone. Gravity 'bends' all null trajectories inside the Minkowski light-cone. This 
conclusion was also reached a number of years ago in [22] ■ 

''At first this seems to contradict the usual fact that far from the source the wave field decays slower than the 
static one. This is indeed true at the level of field-strenghts, but the potentials themselves both decay like 1/r. 
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This is a satisfying result. However, it also makes it impossible to constrain higher- dimension 
operators using superluminality arguments — since the Einstein action already pushed signals inside 
the Minkowski light-cone, higher-order operators can't change this conclusion regardless of their 
sign. 

The difficulty of bounding higher- dimension operators suppressed by Mj>i is also apparent in 
discussing 2^2 scattering amplitudes. From our previous experience, it is evident that there 
arc positivity constraints on 2 — 2 scattering in the forward direction t ^ 0. However, in a 
gravitational theory, already at tree-level, massless 1-graviton exchange dominates the 2 — > 2 
scattering amplitude as t ^ 0, 

Mis,t) r-. Gnj , (59) 

so again, the effect of any higher order operators are swamped by the lowest order massless 1- 
graviton exchange which dramatically violated the Froissart bound. 

In the context of a UV theory with a weak-coupling factor, however, there is some hope. 
For instance, consider weakly coupled string theory, and consider the amplitude for massless 
closed string mode scattering. The lowest order tree amplitude has a contribution from graviton 
exchange, as well as from the tower of Regge states. We can isolate the contribution from Regge 
states simply by subtracting the tree-level graviton exchange diagram. The resulting subtracted 
amphtude then has a well-behaved limit as i — > 0. On the other hand, the Regge behavior of the 
full amplitude as s — > oo for fixed t tells us that the amphtude behaves as s"^*^* large s, and 
therefore, upon subtracting the tree- level graviton contribution that removes the t-channel pole, 
the amplitude is polynomially bounded in the complex s plane. Thus, we should expect that as 
t — > the leading amplitude is of the form 

M.{s,t) Gn h polynomial in with all positive coefficients , (60) 

t 

which again implies an infinite number of constraints on the coefficients of higher-dimension op- 
erators in the theory. 

Let us see how this works explicitly for the scattering of NS-NS bosons of type II strings in 10 
dimensions. The full scattering amplitude is of the form 

Jvl[s,t,u) — K— r^- — — — 1 — r (61) 

where once again i^' is a factor depending on external polarizations; for our purposes it is only 
important that as i ^ 0, X ~ s'^. The amplitude can be expanded as 

M = M^^'' + Al^^sge ^ ^g2) 

where 

stu 

is the tree-level graviton contribution, while M^^^^"^ is by definition the contribution to the tree- 
level amplitude from the heavy Regge modes. Of course M^^^ has a good behavior as t ^ 0, 
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and it is easy to check that it is indeed polynomially bounded at infinity; in fact it is bounded by 
|s|^ at large s. From our general argument, all the terms beginning with are guaranteed to be 
positive, and indeed 

t^O)^ -Mi) + ^ / + ^ s^ + ... (64) 

where all the polygamma functions '?/'„(l) arc negative. 

Thus, while we can't say anything in general about higher-order operators in gravitational 
theories, there is a simple diagnostic for whether certain gravitational amplitudes do not come 
from a weakly coupled string theory — if the short-distance contribution to the amplitide is a 
polynomial in with any negative coefficients, it can't come from a perturbative closed string 
model. 



9 Discussion 

We have shown that certain apparently consistent effective field theories described by local, 
Lorentz-invariant Lagrangians are secretely non-local. The low-energy manifestation of this lurk- 
ing non-locality is the possibility of superluminal signal propagation around coherent background 
fields. This creates a tension between causality and Lorentz-invariance: in such theories no 
Lorentz-invariant notion of causality or locality exists. The high-energy face of this tension is 
that such theories can not be UV completed into full theories that satisfy the usual axioms of 
S'-matrix theory, specifically the analyticity conditions that encode locality. Both local quantum 
field theories and string theories satisfy these properties, so we have provided a simple diagnostic 
for theories that can not be embedded into local QFT's and string theory. 

In effective theories where the particle content or symmetries force the leading interactions to be 
irrelevant operators, completely standard dispersion-relation arguments force positivity conditions 
in these interactions. In terms of the scattering amplitude, the coefficient of the leading term 
in the forward amplitude must be strictly positive. This requirement precisely guarantees the 
absence of superluminal excitations around coherent backgrounds. In weakly coupled theories, 
there is the stronger constraint that the leading amplitude is a power series in with all positive 
coefficients. 

Our analysis applies to the DGP model — specifically the four- dimensional effective theory for 
the "brane-bending" mode tt that is all that is left in a decoupling fimit sending M4, M5 — > 00 
keeping A = Af|/Afi fixed. This scalar theory is controlled by a nice symmetry — interpreted as 
Galilean invariancc in the underlying 5D theory, under which d^n — * 9^7r + c^. This symmetry 
forces the coefficient of the leading operator giving rise to an amplitude proportional to to vanish, 
in contradiction to the strict positivity of this coefficient in any UV local theory. Associated with 
this, signals about consistent tt backgrounds can propagate parametrically faster than light. 

Similar considerations apply to the chiral Lagrangian, where the coefficient of some of the 
4-derivative terms are forced to be positive. In the electroweak chiral Lagrangian, these turn into 
anomalous W/Z quartic gauge couplings in unitary gauge. Thus, UV locality implies that such 
anomalous couplings are necessarily positive. 
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All of our considerations in this paper have concerned Lorentz-invariant theories with a Lorentz- 
invariant ground state. It is the presence of the asymptotic Minkowski space that allows us to get 
into paradoxes involving highly boosted bubbles. While our arguments do not directly apply to 
theories in which the vacuum spontaneously breaks Lorentz invariance, such as Higgs phases of 
gravity [HI [Zl E] or the models studied in [T2] , it would be interesting to ask whether there are any 
analogous constraints to those we have discussed. 

The UV face of the positivity constraints we have described are of course implicitly well-known 
in dispersion theory. In the context of the chiral Lagrangian, precisely the positivity constraints 
we describe have been discussed for instance in pil. However, the directness with which these 
important signs pinpoint effective theories that can or can not arise from local UV theories, and 
the connection with superluminality and the breakdown of macroscopic locality, has not been 
appreciated. For instance, any experimental support for the DGP model, or negative signs for 
anomalous quartic gauge boson couplings, can be taken as a direct indication of the existence of 
macroscopically non-local physics unlike anything ever seen in physics, either in quantum field 
theory or weakly coupled string theory. Experimental tests of positivity then provide a powerful 
probe into the validity of some of our most firmly held assumptions about fundamental physics. 

Note that our results are quite likely to have interesting connections to the physics of horizons. 
For example, in theories violating positivity, a Rindler observer immersed in the translation- 
ally invariant superluminal background can see behind the nominal Rindler horizon to see all 
of Minkowski space. Similarly, it should be possible to see behind black hole and cosmological 
horizons by tossing superluminal bubbles at them, which runs afoul of all the usual rules about 
horizons and their thermodynamic properties. Another concrete link between positivity and black- 
hole physics was developed in It would be interesting to further explore such connections. 



Similarly, it would be interesting to apply positivity to various models under consideration 
to identify which can be embedded in local UV theories, and which are in fact fundamentally 
non-local. Obvious candidates are various beyond-the-standard-models with leading derivative 
interactions, as well as the host of interesting varying speed-of-light models of interest in cosmology, 
many of which very likely run afoul of positivity. 

More generally, we have studied only one very simple constraint on the low-energy effective 
field theory of a local, Lorentz-invariant UV-complete Quantum Field Theory deriving from the 
analyticity of the UV S'-matrix, namely, positivity of forward 2^2 bosonic scattering ampli- 
tudes. For example, it would be interesting to identify constraints deriving from higher scattering 
amplitudes, amplitudes involving fermions, gravitational amplitudes, etc. It would also be inter- 
esting to identify other properties of UV-complete models above and beyond simple analyticity of 
the S'-matrix — do they lead to interesting constraints on which effective field theories may be UV 
completed, and if so, what is the IR pathology of a theory which runs afoul of such a constraint? 
It seems possible that there are many more constraints on just which effective theories may be 
consistently embedded in UV-complete theories, and, in particular, string theory. 
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